Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions

نویسندگان

  • Manu Sridharan
  • Gerald Tesauro
چکیده

We study the use of single-agent and multi-agent Q-learning to learn seller pricing strategies in three diierent two-seller models of agent economies, using a simple regression tree approximation scheme to represent the Q-functions. Our results are highly encouraging { regression trees match the training times and policy performance of lookup table Q-learning, while ooering signiicant advantages in storage size and amount of training data required, and better expected scaling to large numbers of agents. Clear advantages are seen over neural networks, which yield inferior policies and require much longer training times. Our work is among the rst to demonstrate success in combining Q-learning with regression trees. Also, with regression trees, Q-learning appears much more feasible as a practical approach to learning strategies in large multi-agent economies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

User-based Vehicle Route Guidance in Urban Networks Based on Intelligent Multi Agents Systems and the ANT-Q Algorithm

Guiding vehicles to their destination under dynamic traffic conditions is an important topic in the field of Intelligent Transportation Systems (ITS). Nowadays, many complex systems can be controlled by using multi agent systems. Adaptation with the current condition is an important feature of the agents. In this research, formulation of dynamic guidance for vehicles has been investigated based...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

Voltage Coordination of FACTS Devices in Power Systems Using RL-Based Multi-Agent Systems

This paper describes how multi-agent system technology can be used as the underpinning platform for voltage control in power systems. In this study, some FACTS (flexible AC transmission systems) devices are properly designed to coordinate their decisions and actions in order to provide a coordinated secondary voltage control mechanism based on multi-agent theory. Each device here is modeled as ...

متن کامل

Agent-Based Modeling of Day-Ahead Real Time Pricing in a Pool-Based Electricity Market

In this paper, an agent-based structure of the electricity retail market is presented based on which day-ahead (DA) energy procurement for customers is modeled. Here, we focus on operation of only one Retail Energy Provider (REP) agent who purchases energy from DA pool-based wholesale market and offers DA real time tariffs to a group of its customers. As a model of customer response to the offe...

متن کامل

A bi-level programming approach to coordinating pricing and ordering decisions in a multi-channel supply chain

This paper investigates the Stackelberg equilibrium for pricing and ordering decisions in a multi-channel supply chain. We study a situation where a manufacturer is going to open a direct online channel in addition to n existing traditional retail channels. It is assumed that the manufacturer is the leader and the retailers are the followers. The situation has a hierarchical nature and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000